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Abstract 

Superpixels have become prevalent in computer vision. They have been used to achieve satisfactory perfor¬ 
mance at a significantly smaller computational cost for various tasks. People have also combined superpixels with 
Markov random field (MRF) models. However, it often takes additional effort to formulate MRF on superpixel- 
level, and to the best of our knowledge there exists no principled approach to obtain this formulation. In this 
paper, we show how generic pixel-level binary MRF model can be solved in the superpixel space. As the main 
contribution of this paper, we show that a superpixel-level MRF can be derived from the pixel-level MRF by substi¬ 
tuting the superpixel representation of the pixelwise label into the original pixel-level MRF energy. The resultant 
superpixel-level MRF energy also remains submodular for a submodular pixel-level MRF. The derived formula 
hence gives us a handy way to formulate MRF energy in superpixel-level. In the experiments, we demonstrate the 
efficacy of our approach on several computer vision problems. 

1. Introduction 

Many computer vision problems can be cast as image labeling problems. Markov random field (MRF) is a 
general-purpose optimization model for image labeling [1, 2]. Recent progress on MRF shows its prominent 
advantages for solving various computer vision and machine learning problems [3, 4, 5, 6, ]. 

Superpixelization, a.k.a. over-segmentation, is an intuitive yet effective approach to reducing the dimensionality 
of the image space for computer vision problems [8, 9, 10, 11, 12, 13, 14, 15], and it has been used in combination 
with MRF [ 5, 17, 18, 19, 20, 21, 22, 2 '. ]. Superpixels can be used to speed up the image labeling and they often 
form natural regularization to the labeling problems. However, it often takes significant effort to reformulate the 
original pixel-level MRF problem into a superpixel-level MRF problem. To the best of our knowledge, there exists 
no principled approach to obtain the superpixel-level MRF. 

In this paper, we show how to minimize a given generic pixel-level binary MRF energy in the superpixel 
space. To this effect, we first represent pixelwise label by superpixel label. We then substitute this superpixel 
representation into the pixel-level MRF energy. As the main contribution of this paper, we show that superpixel- 
level MRF energy can be derived from the pixel-level MRF. In addition, the derived superpixel-level MRF is 
submodular if the original MRF model is submodular. Fig. 1 illustrates the main idea of this paper. We demonstrate 
the usefulness of our technique on three representative image labeling problems. 

The remaining of this paper is organized as follows. In the next section, we will review the generic form of 
the second order binary MRF. In section 3, we will present the technique that we used to superpixelize the MRF 
energy. In section 4, we briefly introduce the three applications we considered in this work. In section 5, we 
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Figure 1. Superpixelizing MRF and preserving submodularity (representable via s-t graph), k and l are superpixel indices. 
Uk,Ui and 14 1 are the unary and pairwise potentials. 

present the experimental results of the respective applications with comparison to the state-of-the-art methods. In 
section 6, we conclude the paper and suggest some future works. 



2. Binary MRF model for image labeling 

The generic second order MRF model can be written as follows: 

Imii u p(fp) + A)’ (!) 

p£V (p,<?)e N 


where f p and f q are the pixel-wise labels over the image, we consider the label values to be either 1 or 0 henceforth. 
V is the set of all pixels in the image, and J\f is a neighborhood system. U p (-) is known as the unary term or data- 
fidelity term. V pq (- 1 •) is the pairwise potential that is often used to model the pairwise relationship between the 
labels on neighboring pixels. 

For binary-label problem, the unary term can be written more explicitly as 


Up(fp) 


Wp-, i f f P = 1 
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Up(fp) = W l pfp + w°{ 1 - fp) = (Wp - W° p )fp + 71!p. 

The generic form of the pairwise term can be written as 
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Figure 2. Superpixel (cake-cutting) representation of 0-1 labeling 


To sum up, we may rewrite the generic binary label MRF model explicitly as follows: 

mm^2w p f p + ( w pqfpfi + w pqfpfi 

1 pe v (p,q)eM (5) 

+ w lqfpfq + w llfpfq ) • 

where w p = w p — w p . Note that we have omitted the constant terms. 

It has been proven in [24] that if w p ® + w p ^ < w p ^ + w p q, the binary labeling problem is submodular and hence 
can be solved by graph cuts exactly. We will focus on submodular MRF model in this paper. 

3. Superpixelizing MRF 

3.1. Superpixel representation of pixel labeling 

Superpixels are essentially adjacent and non-overlapping image regions. We can denote each superpixel k by 
one indicator function x k defined on the entire image domain, and the superpixel indicator function \ k would 
satisfy: 

Xp = 1, if P e ttk, and XpX l p = 0, (6) 

where we concatenated \ k to be Xp = X k ( z p)> z p is the pixel location, and Xlp is actually the set of all pixels 
belonging to the k- th superpixel. 

Based on the above representation of superpixels, the pixelwise labeling over the image can be represented 
using the superpixel labels as 

K 

/p = I>fex£, (7) 

k =1 

where we considered the concatenated form of pixel labeling f p = /( z p ), xp is the superpixel label. This super¬ 
pixel representation of image labeling is also illustrated in Fig. 2, where we only consider two superpixels, and the 
label value is either 0 or 1. 

We now derive some basic properties from this superpixel representation of image labeling. These properties 
will be useful in the derivation of the superpixel-level MRF energy. 

Lemma 3.1 For any p E ftp f p = xix p = xi where Xli = {p\x p = 1}. 

The above lemma implies the following property. 

Corollary 3.2 f p = Y,k=i %kXp 
We defer their proofs to Appendix. 
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3.2. The derivation of superpixel-level MRF 

With the superpixel representation of image labeling, we are able to write down a naive form of superpixelized 
MRF energy minimization problem: 


min Ei(f) + E 2 (f) 
f,x 


K 

s-t.: f p = ^x k Xp, peV, 
k =1 


( 8 ) 


where E\ and E 2 are the total unary and pairwise potential terms in the original MRF energy, and x = {xp\k = 
1,2,..., K} is the set of all superpixel labels. 

This problem is equivalent to 


min 

X 



(9) 


The above discrete optimization problems may appear to be difficult to solve. 

The main contribution of this paper can be written as a proposition as follows. 

Proposition 3.3 Given that f p = 1 x kXp> the energy in Eq. (9) can be written as an MRF energy defined on 

superpixels, namely 


K K,K 

y^cbkXk + + uux k xj 

k =1 {k=l,l=l\k^l} 

+ (Jj^x k xl + (Jj\\x k xi) , 
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where the first term is the unary 
uk - u k = Y,p w px k p > 


term, i.e. Up, and the second term is the pairwise potential, i.e. 
= E w ™XpXq> and uffp = w™x k v X l Q > {m, n 
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The proof of this proposition is deferred to the Appendix. Eq. 10 gives us a formula which relates the MRF 
energy between superpixel and pixel explicitly. With this formula we can build the MRF for superpixels using the 
MRF in pixel level regardless of the underlying applications. To understand the resultant pairwise potential more 
in-depth, we elaborate on the relationship between the pairwise potentials before and after superpixelization. 
According to Eq. (5), the pairwise potential for the pixel-level MRF can be written as: 


V p q — W p qfpfq + Wpqfpfq + w pq fpfq + W pqfpfq (H) 

Likewise, the pairwise potential for the superpixel-level MRF can be written as: 

Vkl = UkiXj^i + a i k jx^xi + u$x k xi + w\}x k xi. (12) 


Corollary 3.4 Given the pairwise potentials defined in Eq. (11) and Eq. (12), we have the following relationship 
between them: 

Vkl = Eu V P1’ f orp e ^ k and (13) 

{p,q}£Af 

where Xl k and (2/ are different superpixels. 
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Figure 3. Visualization of the relationship between the pairwise potential of superpixel-level and pixel level in corollary 3.4. 
Note that the other two types of pixel-level pairwise potentials will contribute to the superpixel-level unary term as shown in 
proposition 3.3. 


We illustrate the construction of the pairwise potential in Fig. 3. In addition, to ensure the solvability of the 
resultant problem, it is important to maintain the submodularity of the superpixel-level MRF model. We find that 
the derived superpixel-level MRF is indeed submodular if the original pixel-level MRF is submodular. 


Proposition 3.5 If the pairwise potential satisfies the regularity inequality, namely 


w m + w 11 < w 01 + w w 

^pq ' ^pq — ^pq ' ^pq ) 


then the following inequality holds as well. 


,,°° . , ,n <- , ,oi , , ,io 

u kl + u kl — U kl + u kl ^ 


(14) 


(15) 


The proof of this proposition is deferred to the Appendix. 

Comparing with the original MRF model in Eq. (5), the superpixel MRF in Eq. (10) requires significantly 
smaller graph for the same problem. 

3.3. Superpixelizing the Potts model 

One common form of binary MRF energy is the Potts model as follows: 

min E w pfp+ ^2 w pq\fp - fq\ 2 - (16) 

P (p,g)eJV 

We are particularly interested in the superpixel energy form of the above energy. First, we can rewrite the 
energy in the general form as in Eq. (5). Let E 2 ° tts = q )<=N w pq\fp ~ fq\ 2 > we have: 

E P°tts = Wpq \f p -f q \ 2 

(p,q)eJ\f 

’ [ - - X (17) 

— / ^ y w pqfpfq "E w pqfpfqJ • 

(p,q)eJ\f 
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Thus, the corresponding superpixel MRF is the following: 

K K,K 

min E T ^ ^ (^kl%k%l T ^kl’^k’^l^j 

X k =1 jfc=l,Z=l 

K K,K 

min E ^k%k + ^ + UkiXkX^j (18) 

fc=i k^i 

K K,K 

4=> min E ^ ^jfezl^jfe — #z| 2 j 

fc=l k+l 

where wf q = = 0, u k = J2 P W P X P and w kl = J2 pq w P qX P X q - 

3.4. Superpixel MRF for segmentation with detected edges 

It has been shown that the segmentation with an MRF model can be made very effective for object segmentation 
if the detected edge is incorporated in the model [25]. The main contribution in their model is using edge map to 
form the pairwise potential in the Potts model as follows: 

V P q(f P ifq)= w P q\f p — fq\ ? (19) 

where f p and f q are the label variables, they are either 0 or 1, and w^ q is defined as: 

_( exp(- 5 / e (p,?)), Iefaq) 7^0 

™ ~ \ 20, Otherwise ’ V U; 

in which / e (p, q) is 1 if either p or q is on edge [26, 27]. 

As their method targets at automatic object segmentation, computational efficiency is a critical concern. We 
propose to superpixelize their MRF energy to gain similar performance of segmentation at a much smaller compu¬ 
tational cost. Note that it is also not straightforward to reasonably incorporate edge detection in an MRF defined 
on superpixels. 

Interestingly, Ren et al. [21] proposed a superpixel MRF with detected edge. Nevertheless, the explicit relation¬ 
ship between the superpixel-level and pixel-level pairwise potential was not given. Thus, the optimal formulation 
for this term may be obscure. With our superpixelization formula for Potts model established in Eq. (18), the 
explicit form of the pairwise potential for the edge based superpixel MRF can be easily derived from Eq. (19). 

4. Applications 

In this section, we briefly review the applications that we considered in the experimental evaluation. 

4.1. Interactive image segmentation 

Interactive image segmentation is a typical application of MRF model [28]. It has been successfully incorpo¬ 
rated in the system of image cutout [29, 31 ]. The image cutout is now composed of three components, object 
masking, boundary editing and alpha-matting [29]. In this paper, we consider the basic module of an interactive 
segmentation system, i.e. the box and seeds controlled object masking. Recent developments on MRF model 
based interactive image segmentation are mainly focused on the unary term [31, 32, 33]. Since the superpix¬ 
elization of the unary term is relatively straightforward, in this work we consider the effectiveness of our MRF 
superpixelization for the state-of-the-art pairwise potential [25]. 
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4.2. Segmentation propagation in video cutout 

Interactive video cutout is a useful tool in video editing and compositing [34, 35, 36]. It usually begins with an 
interactive key frame segmentation, followed by segmentation propagation. The segmentation propagation step 
automatically generates segmentations of the subsequent frames by motion estimation, foreground-background 
classification and MRF based optimization. On the one hand, since the video cutout is usually a tedious work, 
the efficiency of the segmentation propagation step is crucial to the usability of such system. On the other hand, 
accuracy is of utmost importance in video cutout. In other words, the computational cost should not be reduced at 
any cost of accuracy. We propose to superpixelize the original MRF model in segmentation propagation to safely 
reduce the computational cost. 

4.3. Automatic segmentation proposal generation 

Automatic segmentation proposal generation is a relatively new topic in computer vision [37, 38, 39, 23]. It 
aims to integrate the object detection with object segmentation. The main idea is to generate a pool of segmentation 
results, as a substitute to sliding windows, to feed into the object detector. The major challenge is that this method 
can result in very high computational cost. Normally, thousands of proposals will be generated for each image to 
ensure a satisfactory recall [23]. Although superpixels have been adopted to reduce the computational burden in the 
existing frameworks, the relationship between the formulated MRF models for segmentation with superpixels and 
advanced celebrated pixel-level MRF models [25, 40, 41] remains mysterious. The state-of-the-art segmentation 
methods are generally working on pixel level [ ,5, 32]. Thus, we propose to superpixelize the existing state-of-the- 
art pixel-level MRF, such as in [25], for generating object proposals. Witnessing the effectiveness of the pixel-level 
MRF models, we can expect the similarly successful object proposal generation with the superpixelized MRF. 

5. Experiments 

In this section, we evaluate our method for the aforementioned applications. The methods are implemented 
using MATLAB. We will release our code and datasets upon acceptance. For preprocessing, we adopt a classic 
edge detection method [4z ] and a popular superpixelization method [ 2]. 

5.1. Datasets and evaluation metrics 

Interactive image segmentation. For evaluating our methods with bounding box input, we adopt the dataset 
used in [31]. It is a subset of the Weizmann segmentation dataset, and it contains 100 images with relatively strong 
object-background contrast. We use the bounding box provided with the dataset followed by seeds input generated 
with the robotuser [43] as the user input. 

We measure the performance of the methods with segmentation accuracy and the corresponding user effort to 
achieve the accuracy. The segmentation accuracy is defined as the overlapping ratio between the result and the 
ground truth, i.e. size(H 0 D 77*)/ size(H 0 U 77*) where H 0 is the segmentation result and 77* is the ground-truth 
segmentation. The user effort is measured by the total geodesic distance of the seed points, i.e. the sum of the 
minimal pairwise distance over the point set. In this experiment, the number of superpixels is around 800 for all 
the images. This set of experiments were conducted on a PC with Intel Core i5-450M (2.4GHz) processor and 
8 GB memory. 

Segmentation propagation in video cutout. There is one benchmark dataset for interactive video cutout [36]. 
In the experiment, we evaluate our method on their testing sequences which consists of 6 video sequences with 
2070 frames. Since the video cutout task tolerates very little error, we measure the performance of the methods 
using the boundary deviation , i.e. the average distance from the object boundary in the segmentation result and 
the ground truth object boundary. We also use more superpixels, around 3200, in this experiment. This set of 
experiments were conducted on a PC with Intel Core i7-4700MQ (2.4GHz) processor and 32GB memory. 
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AdaFBC [3 ] AdaFBC + S-SPGC AdaFBC + Aseg on pixels [25] Our method 


Figure 4. Two sets of results for interactive image segmentation. The top rows show the initial box and the seeds provided 
by robot user. The images containing seeds have been whitened. The images are better seen by zooming in. Notice that our 
method produces similar or better results with significant less user effort. 

Segmentation proposal generation. In this experiment, we use the code shared with [2: ]. We evaluate on the 
same test dataset they experimented on, which is part of the PASCAL VOC 2012 segmentation challenge. In the 
comparison we did not include superpixel refinement even it was proven useful for the task. In brief, we directly 
use the SLIC [cite] in the comparison and replace the pairwise potential of the superpixel MRF constructed in 
[23] with the pairwise potential superpixelized from Eq. (19). We adopt the maximum overlapping ratio of the 
generated proposal for each object in each image as the evaluation metric. This set of experiments were conducted 
on a PC with Intel Core i7-4700MQ (2.4GHz) processor and 32GB memory. 

5.2. Results 

Segmentation with detected edges For this task, we adopt Adaptive foreground-background classification 
(AdaFBC) [31] as our foreground-background model. We use the foreground-background probability map pro¬ 
duced by AdaFBC combined with feature based superpixel MRF (SSP-GC), active visual segmentation model [25] 
(Aseg), and our method. Due to the page limit, we only present two set of visual results in Fig. 4, additional re¬ 
sults can be found in the supplementary material. Note that our method only requires one dot seed to achieve a 
satisfactory segmentation in those two examples, while the other methods require either tedious user interactions 
or produce visually noticeable artifacts. We also present the quantitative results of this experiment in Fig. 5 and 
the computation time in Table. 1. We can observe that our method is about 400 times faster than the original 
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User Effort 

classifier [31] *B*FG classifier + SSP-GC 

^^*FG classifier + Aseg [251 ^^FG classifier + Our method 

Figure 5. Ground truth comparison of segmentation score v.s. user efforts with initial bounding boxes. 


Table 1. Computation time for interactive segmentation (seconds per image). 


method 

mean 

std 

min 

median 

max 

ASeg [2i ] 

0.087 

0.0019 

0.086 

0.087 

0.094 

SSP-GC 

0.0081 

0.0014 

0.0063 

0.008 

0.012 

Our method 

0.0026 

0.00024 

0.0024 

0.0026 

0.0036 


pixel-level method [2f ]. Our method is also faster than SSP-GC. This is perhaps because the sparse edge map 
gives good contrast to the MRF weights in our model, which makes the inference much easier and faster. 

Segmentation propagation in video cutout In this experiment, we compare our method with the segmentation 
propagation adopted in [36] and [35]. The latter is known as Rotobrush in Adobe After Effect. We adopt the state- 
of-the art foreground-background classifier proposed in [3 ] to form the unary term in the MRF. While Zhong 
et al. [36] adopted matting for segmentation propagation, the Rotobrush uses graph cuts to solve a conventional 
MRF based segmentation model. In our implementation, we still adopt the model proposed in [25] for this task. 
We present one set of visual results in Fig. 6, more results can be found in the supplementary materials. The 
quantitative results are summarized in Fig. 7. From the results, we can observe that our method achieves the state- 
of-the-art segmentation propagation results. The advantage of our method lies in the computational efficiency, as 
tabulated in Tab. 2. 

Automatic segmentation proposal generation Again, we only present some results of the segmentation pro¬ 
posal generation in Fig. 8 due to the limit in paper length, additional results are in the supplementary materials. 
The visual results suggest that the results of our method better adheres to the object boundaries compared to the 
local and global search (FGS) method [23]. The quantitative results are shown in Fig. 9. From the quantitative 
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Frame 1 Frame 3 Frame 5 Frame 7 Frame 9 

Figure 6. Results of segmentation propagation for the “Car” sequence given the segmentation in the first frame. The back¬ 
ground is whitened for visualization. 
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Figure 7. Error accumulation in segmentation propagation. 


results we can observe that our method generates proposals of high accuracy at a higher probability. Table 3 
compare the computation time which is similar since both [23] and our method run on the superpixel space. 
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Table 2. Computation time for segmentation propagation (seconds per frame). 


method 

mean 

std 

min 

median 

max 

Matting [36] 

0.96 

0.3 

0.42 

0.92 

1.8 

GC [3f ] 

0.38 

0.12 

0.22 

0.35 

0.73 

Our method 

0.003 

0.0002 

0.0026 

0.0029 

0.0036 



Figure 8. Comparison of the maximum overlapping proposals 
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Figure 9. Performance of segmentation proposal generation. 

Table 3. Computation time for segmentation proposal generation (seconds per image). 


method 

mean 

std 

min 

median 

max 

LGS [23] 

10.46 

2.19 

1.38 

10.59 

20.33 

Our method 

9.37 

1.93 

1.26 

9.81 

20.43 




























6. Conclusion and future work 
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In this paper, we propose a technique to convert the generic binary MRF defined on pixels to binary MRF 
defined on superpixels, which we called superpixelization of MRF. The resultant model remains submodular if 
the original model is submodular. We applied the technique to several computer vision problems, and we either 
outperform the state-of-the-art at similar computational cost or we achieve the state-of-the-art at significantly 
smaller computational cost. Our technique is also potentially useful in solving non-submodular energy or multi¬ 
label problems and it is ready for extending to voxel labeling. 

Appendix 

Proof of lemma 3.1 Let’s consider f p defined in Eq. (7). We will have 


fp = fpXpi for p e tt[. 

Substituting Eq. (7) into the above, we will have for any p G fl; 

K 

fp = J2 x k XpXp =xix l 


k =1 


=0, if k^l 


P' 


(A-l) 


(A-2) 


Note that x l p = 1 for any P £ fp = x i- This completes the proof. | 

Proof of corollary 3.2 According to Lemma 3.1, we have for any p e $2/, f p = xi. Thus f p = xj — xix p , for 
any p G Thus for all p G V, we will have f p = J2k=i %kXp- 1 

Proof of Proposition 3.3 We may start by expanding Eq. (9). Accordingly, the unary term in Eq. (5) can be 
rewritten using xp\ 


K 


^ = 53 w pf P = 53 53 w p*p 


k= 1 


K 

Xk = 5 ' ^kXk t 
k =1 


(A-3) 


where u k = £ p w P Xp- 

To superpixelize the pairwise potential of the MRF energy in Eq. (5), we need to superpixelize the four pairwise 

terms: £ W pqfpfq> E W p\fpfq> E W lqfpfq ’ E W l l qfpfq- 

(p,q)£JX (p,q)£JX (p,q)£JX 

Therefore, we have the following identities. 


E 

W pqfpfq 

(p,q)eJ\f 

K K 

E 

w m 53 53 ^4 

(p,q)eJ\f 

k =1 1=1 

K,K 


E 

I ^ ^ ^pqXpXq I %k%l 

k=l,l=l 

\(p,q)eJ\T J 

K 

K,K 


(A-4) 


=J2 oj k° xk+ 53 

k=l {k=l,l=l\k^l} 

K K,K K 

=53 - u( k x k + 53 ^TiXkxi +53 

k =1 {k=l,l=l\k^l} k =1 


, ,00 

u k > 








CVPR reviewers said it’s too simple... 

where ^ w pqXpX l q > and = X] w pqXpXq • Note that = 1 only when the neighboring p, q 

(p,q)eJX (p,q)£JX 

are in two neighboring superpixels, i.e. p G Xlp and q G 0/ for /c ^ l. 

Likewise, 




K,K 



E 

W plfpfq = 

E 


(A-5) 

(p,q)eJ\f 


{fc=i,i=i|fc^/} 





K,K 



E 

W lqfpfq = 

E 

UuXkXi 

(A-6) 

(p,q)eJ\f 


{fc=i,i=i|fc^/} 




K 


K,K 



ii 

M 

+ 

u\}x k x u 

(A-7) 

(p,q)eN 

k= 1 

{/c= 

l,l=l\k^l} 



where ^ w %ipXpX l q> ^p 171 = X] w ^XpXq^ where (m, n) G {0,1}. Note that there are no linear 

(p,q)eJ\r (p,q)eAf 

terms for rn = 1 . n — 0 and m = 0, n — 1 , since x^xp — 0. 

To sum up, the superpixelized MRF energy can be rewritten as 


E 1 + E 2 

K K,K 

= LU k X k + {^klX^Xi + U k jx k x~l 
k =1 

+w^°a; fc x7 + 

+ C, 


(A-8) 


where C is a constant independent of xp^Cop = cc/e — + cc^ 1 , and the remaining variables are defined as before. 

The resultant form turns out to be analogous to the original pixel-level MRF. Note that co™ 71 is treated as 0 for 

m — 1, n — 0 and m = 0, n — 1, since x^x^ = 0. | 


Proof of corollary 3.4 First, we can take summation of V pq over the neighborhood defined by A4 1 = {{p, g} G 
J\f\p G q G k ^ 1} to arrive at the following: 

E V m 

{MleA4 ' _ _ _ (A-9) 

= W pqfpfq + W pqfpfq + W pqfpfq + W pqfpfq 

{p,q}eN k i 

According to lemma 3.1 and corrollary 3.2, the above can be written as: 


E ^ 

= + W m^k X l + + ™l l qX k Xl 

{p,q}eJ\f k l 


(A-10) 


From proposition 3.3, we know that 

w«" = E < A -‘» 

By substituting the above into Eq. (A-10), we obtain the LHS of Eq. (12) which complete the prove. | 
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Proof of proposition 3.5 Let us multiply each term of Eq. (14) with Xp'xl r which is non-negative. We will have 

for any (p, q) G J\f, 


W°°Y k Y l + W U Y k Y l < Up l Y k Y l + U) 1 ®Y k Y l 
^pqApAq 1 ^pq ApAq — ^pq ApAq 1 ^pq ApAq- 


w pq ApAq 


pq ApAq 


w pq ApAq • 


(A-12) 


If we further sum each term over all the (p, q) G AT together, we will have 


( W pqXpX l q + ^llXpXq) 

(p,q)eJX 


< X ( W plXpX l q + wlqXpX l q )• 

(p,q)eJ\T 

By definition of and c the above completes the proof. | 


(A-13) 
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